Increasing confidence of protein interactomes using network topological metrics

نویسندگان

  • Jin Chen
  • Wynne Hsu
  • Mong-Li Lee
  • See-Kiong Ng
چکیده

MOTIVATION Experimental limitations in high-throughput protein-protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-throughput methods to extract true protein interactions. However, the naturally vast interactomes would require much more scalable approaches. RESULTS We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction datasets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction datasets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity. AVAILABILITY The confidence indices for PPIs in yeast, fruit fly and worm as computed by our method can be found at our website http://www.comp.nus.edu.sg/~chenjin/fpfn.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Increasing confidence of protein interactomes using network topological metrics Supplementary Materials APPENDIX A ALTERNATIVE PATHS IN PROTEIN-PROTEIN INTERACTIONS

First, we analyzed protein-protein interaction (PPI) datasets from three different species (Saccharomyces cerevisiae, Drosophila melanogaster, and Caenorhabditis elegans) to investigate the extent to which alternative paths are present in PPI datasets. We focus here only on interactomes that are derived by the popular high-throughput assays such as Y2H. Then, we provide some actual examples in ...

متن کامل

Increasing confidence of protein-protein interactomes.

High-throughput experimental methods, such as yeast-two-hybrid and phage display, have fairly high levels of false positives (and false negatives). Thus the list of protein-protein interactions detected by such experiments would need additional wet laboratory validation. It would be useful if the list could be prioritized in some way. Advances in computational techniques for assessing the relia...

متن کامل

Mining protein interactomes to improve their reliability and support the advancement of network medicine

High-throughput detection of protein interactions has had a major impact in our understanding of the intricate molecular machinery underlying the living cell, and has permitted the construction of very large protein interactomes. The protein networks that are currently available are incomplete and a significant percentage of their interactions are false positives. Fortunately, the structural pr...

متن کامل

Simple Topological Features Reflect Dynamics and Modularity in Protein Interaction Networks

The availability of large-scale protein-protein interaction networks for numerous organisms provides an opportunity to comprehensively analyze whether simple properties of proteins are predictive of the roles they play in the functional organization of the cell. We begin by re-examining an influential but controversial characterization of the dynamic modularity of the S. cerevisiae interactome ...

متن کامل

How and when should interactome-derived clusters be used to predict functional modules and protein function?

MOTIVATION Clustering of protein-protein interaction networks is one of the most common approaches for predicting functional modules, protein complexes and protein functions. But, how well does clustering perform at these tasks? RESULTS We develop a general framework to assess how well computationally derived clusters in physical interactomes overlap functional modules derived via the Gene On...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 22 16  شماره 

صفحات  -

تاریخ انتشار 2006